Survey of execution monitoring tools for computer clusters
نویسندگان
چکیده
Effective management and utilization of large computer clusters requires suitable management tools. This includes tools for monitoring execution, both in real-time (online) and analysis of traces after execution (off-line). Execution monitoring also involves measuring usage of system resources, such as CPU, memory and network. As cluster monitoring is tightly related to administration, some of the tools included in this survey is actually general cluster administration tools.
منابع مشابه
A Clustering Approach to Scientific Workflow Scheduling on the Cloud with Deadline and Cost Constraints
One of the main features of High Throughput Computing systems is the availability of high power processing resources. Cloud Computing systems can offer these features through concepts like Pay-Per-Use and Quality of Service (QoS) over the Internet. Many applications in Cloud computing are represented by workflows. Quality of Service is one of the most important challenges in the context of sche...
متن کاملAn Exploratory Survey of Hadoop Log Analysis Tools
In view of the fact that clusters used in large scale computing are on the rise, ensuring the wellbeing of these clusters is of paramount significance. This highlights the importance of supervising and monitoring the cluster. In this regard, many tools have been contributed that can efficiently monitor the Hadoop cluster. The majority of these tools congregates necessary information from each o...
متن کاملCondition Monitoring Techniques of Power Transformers: A Review
Power transformers provide a vital link between the generation and distribution of produced energy. Such static equipment is subjected to abuse during operation in generation and distribution stations and leads to catastrophic failures. This paper reviewed the techniques in the field of condition monitoring of power transformers in recent years. Transformer monitoring and diagnosis are the effe...
متن کاملA high performance scientific cloud computing environment for materials simulations
We describe the development of a scientific cloud computing (SCC) platform that offers high performance computation capability. The platform consists of a scientific virtual machine prototype containing a UNIX operating system and several materials science codes, together with essential interface tools (an SCC toolset) that offers functionality comparable to local compute clusters. In particula...
متن کاملSurvey of optimizing techniques for parallel programs running on computer clusters
In the current field of high performance computing, clusters technologies plays an ever increasing role. This paper tries to summarize state-of-the techniques for optimization of parallel programs designed to run on computer clusters. Optimizing parallel programs is a much harder task than optimizing sequential programs due to the increased complexity caused be communication and synchronization...
متن کامل